eTRIMS Scene Interpretation Datasets
نویسندگان
چکیده
We describe two image datasets for learning and evaluating interpretations of man-made scenes. The datasets consist of (A) 110 and (B) 200 fully annotated images from the building façade domain. Dataset A emphasises prominent object structures, and has been manually double-checked. Dataset B puts more emphasis on single object recognition on a larger set of classes, and has not been double checked. For both datasets, we define an object partonomy as a set of related labelled polygons, which are provided in an XML format based on the MIT LabelMe database. The datasets can be used as ground truth for training and evaluating object detection and classification algorithms, structure detection and classification algorithms as well as for evaluating complete interpretations of structured scenes, which makes use of the partonomy and taxonomy of objects.
منابع مشابه
Automatic Retrieval of near Photo-realistic Textures from Single Ground-level Building Images
An approach is presented for the automatic retrieval of near photo-realistic textures from single ground-level building images. First, the building facade texture is extracted using the Watershed segmentation technique, which is carried out repetitively until the most successful segment is extracted. Next, the retrieved building image is geometrically rectified in an automated way. After that t...
متن کاملStatistical Segmentation of Polarimetric Sar Data
This paper introduces an approach to the classification and interpretation of SAR data using complementary polarimetric and interferometric information. Strictly polarimetric and polarimetric interferometric data are first analyzed and classified separately. An unsupervised polarimetric segmentation, based on multivariate Wishart statistics, is applied to one of the separate polarimetric datase...
متن کامل3D Scene and Object Classification Based on Information Complexity of Depth Data
In this paper the problem of 3D scene and object classification from depth data is addressed. In contrast to high-dimensional feature-based representation, the depth data is described in a low dimensional space. In order to remedy the curse of dimensionality problem, the depth data is described by a sparse model over a learned dictionary. Exploiting the algorithmic information theory, a new def...
متن کاملA scene change detection framework for multi-temporal very high resolution remote sensing images
The technology of computer vision and image processing is attracting more and more attentions in recent years, and has been applied in many research areas like remote sensing image analysis. Change detection with multi-temporal remote sensing images is very important for the dynamic analysis of landscape variations. The abundant spatial information offered by very high resolution (VHR) images m...
متن کاملProbabilistic Scene Modeling for Situated Computer Vision
Verbal statements and vision are a rich source of information in a human-machine interaction scenario. For this reason Situated Computer Vision aims to include knowledge about the communicative situation in which it takes place. This paper presents three approaches how to achieve scene models of such scenarios combining different modalities. Seeing (planar) scenes as configurations of parts lea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010